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Abstract 

Inai+x2x2x2  randomized  groups  design,  a  study  was  made 
of  correction,  rerun  correction,  noncorrection  and  rerun  noncorrection 
training  methods  in  the  acquisition  and  extinction  of  a  position 
response  and  brightness  discrimination  in  a  parallel  alley  maze  and 
T-maze  using  preference  and.  nonpreference  conditions.  Four  depen¬ 
dent  variables  were  rigorously  def ined--trials,  errors,  correct 
responses,  and  runs.  In  acquisition,  the  factors--training  methods, 
tasks,  and  preference  were  found  significant.  Extinction  results 
showed  no  significant  differences.  Learning  occurred  for  all 
Ss  with  Ss  in  NC  requiring  fewer  reinforcements  to  reach  the 
criterion.  It  was  generally  concluded  that  the  use  of  one  of 
the  noncorrection  procedures  will  likely  give  more  efficient 
learning  of  similar  tasks  in  terms  of  the  defined  dependent 
variables.  Rigorous  definition  of  methods  and  variables  will 


permit  comparison  between  and  in  future  studies. 
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Introduction 


During  the  past  two  decades  there  have  been  numerous  studies 
in  different  areas  of  maze  learning  (e.g.,  acquisition  of  a  position 
response,  a  discrimination  task)*  Within  these  areas,  various 
training  methods  have  been  used  and  often  different  results  have 
occurred  with  similar  problems.  There  have  been  four  basic  training 
methods  used  when  a  S  makes  an  error  in  learning  a  maze.  Usually 
only  two  of  these  have  been  compared  in  any  one  study.  An  exception 
to  this  is  the  series  of  studies  by  Kalish  (19^6).  However,  although 
all  four  methods  were  compared,  this  was  not  accomplished  in  one 
all  encompassing  experiment.  Furthermore,  there  has  been  only  one 
study  (Towart  &  Boe,  1965)  in  which  explicit  definition  of  the 
four  methods  has  been  made.  In  this  study,  these  methods  were 
defined  as  follows: 

1.  Correction  method  (C).  Following  an  error,  the  S  returns 
to  the  choice  point  and  makes  another  choice.  A  trial  is  completed 
after  the  correct  choice  is  made. 

2.  Rerun  correction  method  (R-C).  Following  an  error,  the  S 
returns  to  the  choice  point  and  makes  another  choice;  when  the  correct 
choice  is  made,  the  S  is  returned  to  the  start  point  and  another  run 
commences.  The  first  errorless  run  completes  a  trial. 

3.  Noncorrection  method  (NC).  Following  an  error,  the  S  is 
prevented  from  returning  to  the  choice  point.  A  trial  is  completed 
upon  the  commission  of  an  error  or  correct  choice. 

b.  Rerun  noncorrection  method  (R-NC).  Following  an  error,  the 
S  is  prevented  from  returning  to  the  choice  point,  and  is  returned 
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to  the  start  point  where  another  run  commences.  The  first  errorless 
run  completes  a  trial. 

Two  investigations  (Muenzinger  &  Powloski,  1951;  Towart  &  Boe, 
1965)  compared  C  with  R-NC  which  is  relevant  to  an  evaluation  of  C 
because  the  number  of  reinforcements  is  the  same  in  both  instances. 
Comparisons  between  C  and  NC  (Besch,  Morris,  &  Levine,  1963;  Hull 
&  Spence,  1938;  North,  1950a;  North?  1950b;  Parducci  &  Polt,  1958; 
Seward,  19^-3)  have  shown  learning  in  both  groups  even  though  Ss 
in  C  received  a  larger  number  of  reinforcements  than  those  in  NC. 

Kalish  (19^6)  compared  the  four  methods  of  training  with  rats 
in  an  H-maze.  A  S  started,  for  example,  at  one  end  of  the  left 
section  and  was  required  to  run  to  the  opposite  end  of  the  right- 
section.  The  start  point  was  varied  from  the  upper  to  the  lower 
end  of  the  left  section.  No  evidence  of  learning  the  correct 
turn  at  the  intersection  of  the  cross  section  and  the  right 
section  was  evident  in  the  C  group;  evidence  of  learning  in  the 
NC  group  was  found  after  many  trials;  the  criterion  of  learning 
was  met  by  Ss  in  the  R-C  and  R-NC  groups,  with  the  R-NC  group 
learning  in  fewer  trials  and  with  fewer  errors. 

In  the  comparison  of  C  with  R-NC,  Muenzinger  and  Powloski  (1951) 
found  that  rats  acquired  a  light/dark  discrimination  in  a  T-maze 
significantly  faster  by  C.  This  result  is  opposite  to  that  found 
by  Kalish  (19^6)  with  rats  under  different  conditions,  e.g.,  task 
and  apparatus  differences.  The  investigations  carried  out  by  Towart 
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and  Boe  (1965)  showed  no  significant  differences  between  C  and  R-NC 
in  the  acquisition  of  a  position  response  in  a  parallel  alley  maze. 
However,  in  a  series  of  six  position  reversals,  R-NC  was  found  to 
be  superior  to  C.  It  was  tentatively  concluded  that  the  superiority 
of  one  method  over  the  other  during  acquisition  depends  on  the 
nature  of  the  task. 

Kalish's  (19^-6)  finding  that  NC  is  superior  to  C,  in  spite  of 
fewer  reinforcements,  was  also  observed  in  rats  learning  a  position 
response  in  a  T-maze  by  Seward  (19^3 )>  in  rats  trained  in  a  T-maze 
to  different  reinforcement  schedules  by  Parducci  and  Polt  (1958), 
and  in  rats  learning  a  drive  discrimination  in  a  T-maze  by  Besch  et  al 
(1963).  However,  Hull  and  Spence  (1938)  found  little  difference  in 
terms  of  errors  between  C  and  NC  during  acquisition  of  a  position 
response  by  rats  in  a  T-maze  although  the  NC  group  acquired  the 
habit  with  fewer  reinforcements.  No  tests  of  significance  were  reported 
for  these  data,  however.  Then,  after  a  standard  number  of  trials  for 
all  Ss,  the  position  response  was  reversed.  The  results  showed  fewer 
errors  during  reversal  learning  for  the  C  group,  although  the  NC  group 
reached  the  same  level  of  performance  with  fewer  reinforcements.  In 
a  series  of  k-Q  reversals  of  a  position  response  in  a  T-maze,  North 
(1950a;  1950b)  found  no  significant  difference  in  errors  between  C 
and  NC  groups  during  reversal  learning.  The  C  group  received  about 
half  again  as  many  reinforcements  as  the  NC  group.  The  position 
response  was  reversed  after  a  set  number  of  trials,  not  after  a 
performance  criterion  had  been  attained.  In  general,  the  results 
of  these  experiments  showed  that  NC  vields  more  rapid  learning  of  a 
single  choice  point  maze  by  rats  than  C. 
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Inasmuch  as  the  experiments  (Kalish,  1946;  Muenzinger  &  Powloski, 
1951;  Towart  &  Boe,  1965)  comparing  C  with  R-NC  produced  conflicting 
results  and  yet  were  similar  in  most  respects  except  for  task  and 
apparatus  (Kalish- -delayed  reaction  problem  in  H-maze;  Muenzinger  and 
Powloski --brightness  discrimination  in  T-maze;  Towart  and  Boe --position 
response  in  parallel  alley  maze)  it  is  evident  that  further  study  is 
required  before  any  general  statement  of  their  relative  merits  can 
be  made.  Furthermore,  Towart  and  Boe  (1965)  suggested  that  systematic 
study  of  the  influence  of  training  methods  could  be  of  considerable 
importance  in  the  selection  of  training  method  in  learning  experi¬ 
ments.  It  has  been  demonstrated  in  several  pairs  of  experiments 
(e.g.,  Blodgett  &  McCutchan,  1944  vs.  Kalish,  1946;  Wischner,  1947 
vs.  Muenzinger  &  Powloski,  1951)  that  very  different  conclusions  can 
be  reached  about  the  same  problem  when  different  methods  of  training 
are  used. 

The  purpose  of  this  study,  therefore,  was  to  experimentally 
compare  the  four  methods  of  training  in  two  apparatus  designs  (the 
parallel  alley  and  T-mazes)  and  in  two  learning  situations  (acquisi¬ 
tion  of  a  position  response  and  a  brightness  discrimination).  The 
influence  of  training  to  preference  or  nonpreference  was  introduced 
as  a  factor  which  may  affect  the  efficiency  of  a  training  method 
under  certain  conditions.  Thus  several  questions  were  considered: 

1.  Is  there  a  difference  between  training  methods? 

2.  Does  apparatus  design  influence  the  efficiency  of 


training  methods? 


. 
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3*  ijo  training  methods  differ  in  efficiency  fur  different 
tasks? 

4.  Does  the  number  of  reinforcements  received  by  the  Ss 
influence  results? 

5.  What  effect  does  training  to  a  preference  or  a  nonpreference 
have  on  results?  Is  there  any  interaction  with  training 
methods? 

6.  If  a  difference  between  training  methods  is  attained  in 
acquisition,  will  the  extinction  of  this  response  show 
similar  differences? 

In  the  latter  problem  NC  was  the  extinction  method  because  essentially 
all  Ss  would  be  using  this  during  the  last  two  days  of  acquisition 
(criterion  days).  The  present  investigation  extends  the  conditions 
under  which  the  four  methods  of  training  have  been  compared. 
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Method 


Subjects 

The  Ss  were  96  experimentally  naive  male  albino  rats  from  a 
local  supplier  (N.  A.  T.  Lab  Co.)  approximately  90  days  old  at  the 
beginning  of  the  experiment.  All  Ss  were  individually  caged  in  the 
experimental  room  under  constant  artificial  illumination. 

Apparatus 

One  apparatus  was  the  parallel  alley  maze  illustrated  in  Fig.  1 
and  scale  drawing,  Fig.  2.  It  was  constructed  of  wood,  painted  flat 
black,  with  hinged  lids  of  hardware  cloth  on  top  and  lights  shielded 
by  "flashed  opal"  glass.  The  second  apparatus  was  the  T-maze  illus¬ 
trated  in  Fig.  3  and  scale  drawing.  Fig.  k.  This  was  built  to  the 
same  specifications  as  the  parallel  alley  maze  and  constructed  of 
the  same  materials.  The  only  major  difference  between  the  two  mazes 
was  the  position  of  the  goal  boxes  in  relation  to  the  choice  compart¬ 
ment.  In  the  T-maze,  the  alleys  to  the  goal  boxes  were  at  l80 
degrees  to  each  other.  A  glass  furniture  coaster  in  each  goal  box 
served  as  water  cups. 

The  E  observed  the  Ss '  behavior  and  operated  the  guillotine 
doors  in  each  maze  from  behind  a  partition  which  prevented  his 
movements  from  being  seen  by  the  Ss. 

Design 

The  study  used  a  k  x  2  x  2  x  2  randomized,  groups  design  with 
three  replications.  The  32  Ss  in  each  replication  were  randomly 


Fig.  2 


Scale  drawing  of  the  parallel  alley  maze.  SB 
start  box;  GB  =  goal  box;  d  =  sliding  door. 
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p-i g<  2.  The  T-rnaze  showing  "the  shield  behind  which  the  E  operated  the  guillotine  doors. 
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Fig.  k.  Scale  drawing  of  the  T-maze.  SB  =  start  box;-GB  =  gos.1  box;  d  =  sliding  doors. 
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assigned  to  each  of  the  treatments.  There  were  four  training  groups 
(T):  C,  the  correction  method  group;  R-C,  the  rerun  correction  method 
group;  NC,  the  noncorrection  method  group;  and  R-NC,  the  rerun  non- 
correction  method  group.  Each  group  had  eight  Ss  which  were  further 
randomly  assigned  to  one  of  the  two  mazes  (D) --parallel  alley  or  T. 

The  four  Ss  in  each  of  these  groups  were  then  randomly  assigned  to 
either  the  spatial  position  problem  (A^)  or  the  discrimination  problem 
(A2).  As  a  result,  two  Ss  were  in  each  condition  and  one  of  the  two 
was  trained  in  the  evening  and  the  other  in  the  morning.  All  Ss  were 
then  tested  for  preference  in  the  pretraining  period.  That  is,  _Ss 
in  the  position  group  were  given  ten  free  nonreinforced  trials  to 
assess  their  preference  for  the  right  or  left  goal  box  and  similarly, 
the  Ss  in  the  discrimination  group  were  given  ten  free  trials  to 
show  their  preference  for  the  goal  box  with  or  without  the  added 
light.  Half  the  Ss  were  then  assigned  to  the  condition  of  training 
to  their  preference  and  half  to  their  nonpreference. 

Each  S  was  given  five  trials  per  day  and  the  criterion  of  learning 
was  set  at  two  consecutive  errorless  days  of  five  trials  each  for 
acquisition  of  the  learned  response.  Extinction  for  all  Ss  was 
carried  out  using  NC  for  eight  days  (^0  trials). 

In  addition,  VTEs  (vicarious  trials  and  errors)  were  recorded 
with  the  intention  of  assessing  the  correlation  between  VTEs  and 
correct  and  wrong  responses  in  the  experimental  design.  One  VTE 
was  counted  when  a  rat  looked  up  one  alley  and  this  was  followed  by 
a  look  at  (or  final  taking-off -toward)  the  other  door  (Tolman,  1951). 
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Terminology 

A  trial  was  defined  in  terms  of  group  membership.  That  is, 
a  trial  for  Ss  in  the  C  and  R-NC  groups  was  completed  on  performance 
of  the  correct  response  and  subsequent  reinforcement.  A  trial  for 
Ss  in  NC  was  completed  upon  making  either  a  correct  or  wrong  response. 
Subjects  in  R-C  completed  a  trial  only  upon  making  a  run  from  start 
box  to  correct  goal  box  (no  wrong  responses).  An  error  was  defined 
as  a  wrong  turn  (wrong  response)  by  the  entry  of  the  whole  body  of 
the  S  exclusive  of  tail  into  the  incorrect  alley.  A  correct  response 
could  be  defined  in  two  ways:  (a)  when  a  S's  first  choice  results 
in  the  attainment  of  the  reward,  and  (b)  when  a  S  attains  reinforce¬ 
ment  regardless  of  the  number  of  choices  made  prior  to  this.  In 
the  analyses  reported  in  the  results  section,  definition  (b)  is 
used  although  an  analysis  using  definition  (a)  is  considered  in 
the  general  discussion  section.  The  termination  of  a  response 
by  the  E  (the  removal  of  the  animal  from  the  apparatus)  constituted 
a  run.  A  comparison  of  the  dependent  variables  and  training  methods 
is  shown  in  Table  1. 

Procedure 

Pretraining  and  experimental  training  were  the  same  for  the 
three  replications,  each  involving  32  Ss. 

Pretraining.  The  Ss  were  placed  on  a  23-hour  water  deprivation 
schedule  five  days  before  pretraining  commenced  and  were  maintained 
on  this  schedule  throughout  the  experiment.  During  this  period  the 
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Table  1 

A  Scoring  Comparison  of  Dependent  Variables  and 
Training  Methods  Based,  on  One  Trial. 


Dependent  Variables 

Training 

Methods 

Trials 

Errors 

Correct 

Responses 

Runs 

C 

1 

. 

0,  1  or 

x  a 

1 

1 

R-C 

1 

0,  1  or 

X 

1  or  x 

1  or  x 

NC 

1 

0  or  1 

0  or  1 

1 

R-NC 

1 

0,  1  or 

X 

1 

1  or  x 

a 

x  >1 


Ss  were  briefly  handled  each  day  by  the  E  and  then  watered  for  an  hour. 
Sixteen  Ss  were  placed  on  morning  training  and  the  remainder  on  evening 
training.  Food,  was  always  available  in  the  living  cages. 

On  the  sixth  day,  all  Ss,  four  at  a  time,  were  placed  together 
in  the  parallel  alley  or  T-maze  according  to  assignment.  Each  group 
of  four  Ss  involved  the  four  training  methods  to  acquire  one  of  the 
two  problems  in  a  particular  apparatus  design.  Free  exploration  in 
the  apparatus  with  all  guillotine  doors  open  was  allowed  for  ten 


minutes  followed  by  ten  minute  periods  in  the  living  cages  for  a 
period  of  one  hour  (three  ten  minute  sessions  in  the  maze  alternately 
with  three  ten  minute  sessions  in  individual  living  cage).  During 
the  final  ten  minute  period,  the  goal  box  doors  were  closed,  the 
drinking  cups  were  filled  with  water,  and  each  S  was  individually 
watered  in  each  goal  box  for  one  minute. 

On  the  seventh  day  each  S  in  the  group  of  Ss  to  be  run  alter¬ 
nately,  was  given  preference  trials.  In  the  case  of  the  spatial 
problem,  Ss  were  given  ten  free  trials  to  chose  the  right  or  left 
goal  box.  (A  trial,  here,  was  defined  as  the  unreinforced  choice  of 
either  the  right  or  left  goal  box. )  The  intertrial  interval  varied 
due  to  the  slowness  of  certain  Ss.  Each  S  was  tested  individually 
in  the  same  order  in  which  it  would  receive  its  training.  That  is, 
the  Ss  to  be  trained  on  the  spatial  position  problem  in  the  parallel 
alley  maze,  were  tested  individually  in  the  order  in  which  they 
would  receive  their  training--C,  R-NC,  R-C,  NC.  Similarly,  each 
S  in  the  discrimination  condition  was  given  ten  trials  to  establish 
a  preference  for  the  goal  box  with  added  or  no  added  illumination. 

The  position  of  the  added  light  was  varied  according  to  the  Gellerman 
(1933)  series  RRLLRLLRRL. 

Under  each  condition  half  the  Ss  were  to  be  trained  to  their 
preference  and  half  to  their  nonpreference.  The  decision  to  train 
a  S  to  its  preference  or  nonpreference  was  decided  by  the  flip  of  a 
coin.  If  a  S  showed  no  preference  in  the  ten  trials  (50-50)  a  coin 
was  tossed  to  establish  its  preference  (seven  Ss  out  of  9 6  or  7*3 1° 
showed  no  preference).  The  criterion  of  preference  was  six  or  more 
responses  to  the  right  or  left  goal  box,  or  to  the  lighted  or  unlighted 
goal  box,  for  Ss  in  A±  and  A 2  respectively. 
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Acquisition.  Training  commenced,  on  the  day  after  pretraining 
and.  consisted  of  reinforcing  either  the  preferred  or  nonpreferred 
goal  hox  by  two  drops  of  water  in  a  water  cup.  A  S  was  placed  in 
the  start  box  with  d^  closed  and  all  other  guillotine  doors  open. 

After  15  seconds,  d.j  was  opened.  When  a  correct  choice  was  made 
the  appropriate  door  was  closed.  If  an  error  was  committed,  the 
door  was  closed  behind  the  Ss  in  the  NC  and.  R-NC  groups.  Upon 
mailing  an  incorrect  response,  Ss  trained  by  NC  and.  R-NC  were  retained 
in  the  goal  box  for  15  seconds.  The  Ss  of  the  NC  group  were  returned 
to  their  living  cages  15  seconds  after  either  a  correct  or  incorrect 
response.  The  Ss  of  the  R-NC  group  were  returned  to  the  start  box 
for  the  rerun.  Subjects  trained  by  C  or  R-C  were  allowed  to  return 
to  the  choice  point  after  having  made  a  wrong  turn.  The  C  group 
Ss  were  returned  to  their  living  cages  upon  making  a  correct  response. 
Those  Ss  of  the  R-C  groups  were  returned  to  the  start  box  for  the 
rerun  and  returned  to  their  living  cages  upon  making  an  errorless 
run. 

Training  was  carried,  out  individually  in  groups  of  four  Ss-- 
C  (correction),  R-NC  (rerun  noncorrection),  R-C  (rerun  correction), 
and  NC  (noncorrection),  with  Ss  to  learn  the  position  habit  (A^)  in 
the  parallel  alley  followed  by  Ss  in  the  T-maze.  Then  another  four 
Ss  were  individually  trained  in  the  same  order  on  the  discrimination 
problem  (A2)--four  Ss  in  the  parallel  alley  followed  by  four  in  the 
T-maze.  Five  trials  per  S  were  run  each  day  with  the  intertrial 
interval  kept  proportional  over  days.  That  is,  the  intertrial  interval 
was  longest  on  the  first  day  becoming  progressively  shorter  as  learning 
progressed  with  a  minimum  of  five  minutes. 
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The  Ss  in  group  (positi.on  habit)  were  given  no  additional 
cue  of  goal  box  differences  (no  added,  light  in  the  positive  box). 

The  Ss  were  trained  according  to  their  preference  or  nonpreference 
of  a  goal  box  position.  Subjects  in  A2  condition  were  trained 
according  to  their  preference  or  nonpreference  of  an  added  light 
in  the  positive  goal  box.  The  added  light  was  on  in  the  goal  boxes 
according  to  the  series --RLLRR  on  odd  training  days  and.  RLRLL  on 
even  days.  If,  for  example,  a  S  was  being  trained,  to  a  preference 
or  nonpreference  of  added  light,  the  reward  on  the  first  trial  of 
day  one  was  on  the  right,  otherwise,  the  reward  was  on  the  left. 

The  running  time  (seconds)  was  recorded  from  the  opening  of  d^  to 
a  S  making  a  choice  (right  or  wrong)  for  all  Ss  under  all  conditions. 

Extinction.  After  each  S  met  the  criterion  of  two  consecutive 
errorless  days,  extinction  commenced  in  all  conditions.  The  extinc¬ 
tion  method  was  NC  for  all  Ss  regardless  of  their  original  groups. 
There  was  no  reward  and  each  S  ran  five  trials  per  day  for  eight 
days  ( kO  trials).  The  running  time  from  opening  d  to  a  S  making  a 
choice,  right  or  wrong  was  recorded,  for  all  Ss. 
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Results 

In  order  to  facilitate  the  coherence  of  this  presentation  of 
the  results,  the  data  will  be  presented  in  terms  of  the  following 
dependent  variables  under  the  major  headings  of  acquisition  and 
extinction:  trials,  errors,  correct  responses,  and  runs.  The 
questions  posed  in  the  introduction  will  be  considered  in  the 
discussion.  In  addition,  all  interactions  are  shown  graphically 
in  the  appendix. 

Acquisition .  The  running  speeds  from  start  box  to  making 
a  choice  during  acquisition  showed  no  significant  differences  and 
no  further  consideration  of  these  will  be  made  here.  The  means  and 
standard  deviations  of  the  four  dependent  variables,  trials,  errors, 
correct  responses,  and  runs  during  acquisition  of  a  position  habit 
(Aq)  and  discrimination  learning  (A2)  are  given  in  Table  2.  The 
analyses  of  variance  were  computed  using  orthogonal  comparisons 
(oC  =  .05  one-tailed  F  test).  The  data  for  all  conditions  were 
subjected  to  test  (cxf  =  .05)  with  no  indication  of  hetero- 

geneity. 

Trials.  In  terms  of  the  definitions  used  here,  a  trial  was 
equivalent  to  a  correct  response  (reinforcement)  and  a  run  in  C; 
a  run  in  NC;  and  a  correct  response  in  R-NC  (Table  2).  The 
acquisition  of  the  position  habit  required  significantly  fewer 
trials  to  reach  the  criterion  of  learning  than  the  brightness 
discrimination  task  (p  ^  .001)  (Table  3).  The  only  other  signi¬ 
ficant  main  effect  found  was  that  training  to  preference  was  more 


efficient  than  training  to  nonpreference  (p  .01).  A  test  of 
the  D  x  T  interaction  showed  a  significant  difference  "between  C 
and  R-C  only  (p  <  .05). 
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Table  2 


Means  and  Standard  Deviations  for  Trials,  Errors,  Correct  Responses, 
and  Runs  to  Criterion  for  Training  Methods  in  Acquisition  of  a 
Position  Response  (Aj_)  and  Brightness  Discrimination  (A2) 


Training 

Methods 

Ai 

or 

a2 

— 

Trials 

Errors 

Correct 

Responses 

Runs 

Mean 

SD 

Mean 

SD 

Mean 

SD 

s 

Mean 

SD 

C 

A1 

a2 

19.55 

47.05 

14.98 

20.61 

11.75 

29.60 

13.53 

14.98 

19.55 

47.05 

14.98 

20.61 

19.55 

47.05 

14.98 

20.61 

R-C 

ai 

a2 

18.30 

38.75 

12.49 

13.84 

13.25 

36.10 

10.17 

9.09 

28.85 

70.85 

18.37 

22.52 

28.85 

70.85 

18.37 

22.52 

NC 

A1 

A2 

25.00 

36.67 

7.83 

16.80 

10.75 

16.85 

4.11 

7.28 

14.25 

19.60 

3.85 

10.55 

25.00 

36.67 

7.83 

16.80 

| 

R-NC 

A1 

a2 

16.67 

35.^5 

3.91 

22.20 

9.15 

20.90 

4.24 

13.81 

16.67 

35.50 

3.91 

22.20 

f 

1 

25.83 |  6.99 
» 

56.40,37.04 

- i - 
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Table  3 

Analysis  of  Variance  of  the  Trials  to  Criterion 

in  Acquisition 


df 

MS 

F 

Apparatus  (D) 

1 

176.04 

Problem  (A) 

1 

9204.17 

48.96*** 

Training  Methods  (T) 

3 

284.03 

l.4l 

Preference  (P) 

i 

1 

1350.00 

7.18** 

D  x  A 

1 

- 

D  x  T 

3 

413.19 

2.20 

D  x  P 

1 

37.50 

A  x  T 

3 

189.24 

1.01 

A  x  P 

1 

234.37 

1.25 

T  x  P 

3 

46. 53 

D  x  A  x  T 

3 

54.86 

-  -  -  • — > 

D  x  A  x  P 

1 

234.37 

1.25 

D  x  P  x  T 

3 

84.03 

A  x  P  x  T 

3 

1.15 . 60 

D  x  A  x  T  x  P 

3 

240.63 

1.28 

Treatment 

31 

500 . 67 

2 . 66*** 

Replications 

2 

1932.67 

Error 

62 

187 . 99 

**p  ^  .01 

***p  <.001 
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Errors .  The  analysis  of  variance  of  the  error  data  demon¬ 
strated  that  the  acquisition  of  a  position  habit  differed  signifi¬ 
cantly  from  the  discrimination  task  (p  <  .001)  (Table  4-).  The 
apparatus  design  was  not  found  to  be  a  significant  factor  in  terms 
of  errors  but  preference  was  found  significant  beyond  the  .05 
level.  The  interaction  of  tasks  and  training  methods  (A  x  T) 
and  the  three-way  interaction  of  apparatus  design  by  problems  by 
training  methods  (D  x  A  x  T)  showed  significance  beyond,  the  .05 
level.  An  examination  of  the  orthogonal  comparisons  showed  a 
significant  difference  (p  .001)  between  combined  correction  methods, 
Cc,  where  Cc  =  C  +  R-C,  and.  the  combined  noncorrection  methods, 

NCc  =  NC  +  R-NC.  That  is,  the  C  method  plus  R-C,  was  not  as 
efficient  in  training  Ss  as  was  the  NC  method,  including  R-NC. 

This  was  particularly  true  in  learning  the  discrimination  problem 
in  both  parallel  alley  and  T-mazes.  This  finding  was  further 
confirmed  in  the  problem  by  training  method  (A  x  T)  and  apparatus 
by  problem  training  method  (D  x  A  x  T)  interactions  (p<(  .01).  In 
A  x  T  and  D  x  A  x  T,  the  NC  group  showed,  faster  learning  of  the 
discrimination  task  (A2),  whereas  R-NC  was  more  efficient  in  learning 
the  position  habit  (Aq). 

Correct  responses-~reinforcements .  The  analysis  of  variance 
of  correct  responses  as  reinforcements,  showed,  all  main  effects 
except  apparatus  design  significant  (p  \  .001)  (Table  5)-  Exami¬ 
nation  of  the  orthogonal  comparisons  in  the  main  effects  showed 
NCc  to  be  more  efficient  than  Cc  with.  C  significantly  more  efficient 
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Table  4- 

Analysis  of  Variance  of  Errors  to  Criterion 

in  Acquisition. 


df 

MS 

F 

Apparatus  (B) 

1 

130.67 

1.4-8 

Problems  (A) 

1 

4-4-77 . 13 

50.77*** 

Training  Methods  ( T ) 

3 

614-.75 

6.97*** 

Preference  (P) 

1 

601.04- 

6.92* 

D  x  A 

1 

35. 0^ 

D  x  T 

3 

133.36 

1.51 

B  x  P 

1 

63.38 

A  x  T 

3 

317.79 

3  •  60* 

A  x  P 

l 

24-.  00 

P  x  T 

3 

43 . 37 

_  1 

B  x  A  x  T 

3 

380.29 

4-.  31** 

B  x  A  x  P 

1 

253.50 

2.87 

B  x  P  x  T 

3 

51.07 

A  x  P  x  T 

3 

73.  Ml- 

B  x  A  x  P  x  T 

3 

159.58 

1.81 

Treatments 

31 

352.08 

3 . 99*** 

Replications 

Error 

2 

62 

1007.87 

88.18 

*P  <1  .05 

**p  <_  .  01 

***p  <  .001 


. 
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Table  5 

Analysis  of  Variance  of  Correct  Responses  to  Criterion 

in  Acquisition 


df 

MS 

F 

Apparatus  (D) 

1  ! 

283.59 

1.87 

Problems  (A) 

1 

12903.84 

85.20*** 

Training  Methods  (T) 

4547.35 

30.02*** 

Preference  (P) 

! 

l 

2044.26 

13 . 50*** 

D  x  A 

l 

8.76 

D  x  T 

3 

523.17 

3  •  '+5* 

D  x  P 

l 

36.26 

A  x  T 

3 

1447.59 

9. 56*** 

A  x  P 

1 

15.84 

P  x  T 

3 

85.39 

D  x  A  x  T 

3 

1366.87 

9 . 02*** 

D  x  A  x  P 

1 

536.76 

3.54 

D  x  P  x  T 

3 

87.28 

A  x  T  x  P 

J 

223.37 

1.47 

D  x  A  x  T  x  P 

3 

79.82 

Treatments 

31 

1319.73 

8.71*** 

Replications 

2 

2519.88 

Error 

62 

151.46 

*p  c  *°5 


-x-x-x-p  „  001 


- 
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than  R-C  (p  <T  .001).  In  addition,  NC  was  found  to  be  significantly 
superior  to  R-NC  in  all  instances.  The  task-training  method 
interaction  (A  x  T)  demonstrated  that  NCc  was  superior  in  the 
learning  of  both  problems  and  noticeably  superior  in  learning 
the  brightness  discrimination  task  (p  .001).  Both  problems 
were  learned  by  all  Ss  in  the  noncorrection  and  correction  groups. 
However,  there  were  fewer  reinforcements  required  by  the  non- 
correction  methods  to  reach  the  criterion  of  learning  than  by  the 
correction  methods.  Little  difference  between  training  methods 
occurred  in  learning  the  position  habit  in  the  parallel  alley  and 
T -mazes.  However,  the  D  x  A  x  T  interaction  demonstrated  differences 
between  Cr  and  NC  in  the  parallel  alley  in  learning  the  discrimi- 
nation  task  (p  <  .001).  This  same  interaction  demonstrated  marked 
differences  between  R-C  and  NC  and  little  difference  between 
C  and  R-NC  in  the  T-maze  in  learning  the  discrimination.  In 
addition,  the  apparatus  design  by  training  method  (D  x  T) 
interaction  showed  a  significant  difference  between  C  and  R-C 
(p  <  .01)  with  R-C  requiring  the  most  reinforcements  to  reach 
the  criterion  of  learning.  NC  was  shown  to  be  the  superior  method 
in  terms  of  reinforcements. 

Runs.  The  analysis  of  variance  demonstrated  that  the 
discrimination  task  required  more  runs  to  reach  the  criterion 
of  learning  than  the  position  habit  (Table  6).  As  for  the 
previous  three  dependent  variables,  no  significant  differences 
between  the  parallel  alley  and  T-maze  designs  were  obtained. 
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Table  6 

Analysis  of  Variance  of  Runs  to  Criterion 
in  Acquisition 


- f 

df  ! 

i 

MS  { 

F 

■j 

] 

Apparatus  (D) 

J 

1 

29^.00 

f 

Problems  (A) 

i 

19410 . 07 

62.39*** 

Training  Methods  (T) 

3 

1525.99 

4.90** 

Preference  (P) 

( 

1 

2562.67 

8 . 24** 

D  x  A 

1 

60.17 

D  x  T 

3 

566. 44 

1.82 

D  x  P 

l 

240 . 67 

A  x  T 

3 

942.11 

3.03* 

A  x  P 

1 

228.17 

P  x  T 

3 

44.97 

D  x  A  x  T 

3 

148.18 

D  x  A  x  P 

1 

450.67 

1.45 

D  x  P  x  T 

3 

344.80 

l.ll 

A  x  T  x  P 

3 

379.25 

1.22 

D  x  A  x  T  x  P 

3 

409.52 

1.32 

Treatment 

31 

1171.94 

3 . 77*** 

Replications 

2 

3551.85 

Error 

62 

311.11 

*P  <  -05 

**p  < .01 
***p  <.001 
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However,  the  other  three  main  effects  reached  significance 
(p  «(  .01).  An  examination  of  the  orthogonal  comparisons 
showed  that  C  was  significantly  superior  to  R-C  in  terms  of 
runs  (p  <  .01).  The  A  x  T  (problems  by  training  methods) 
interaction  showed  that  the  NC^  methods  were  overall  superior 
to  Cc  in  terms  of  runs.  For  learning  the  position  habit,  C 
was  the  superior  method  while  NC  was  more  efficient  in  learning 
the  discrimination  task. 

Extinction.  The  NC  method  was  used  in  the  extinction  of 
the  acquired  habits.  This  made  the  error  score  and  the  correct 
responses  together,  equivalent  to  the  number  of  runs  or  trials 
(^0  runs  or  trials).  The  error  scores  are  presented  in  Table  7 
and  no  significant  differences  were  found  (Fs<^  l). 

The  running  speeds  from  start  box  to  making  a  choice  were 
recorded  on  all  trials.  The  mean  reciprocal  values  of  the  time 
in  seconds  were  plotted  against  blocks  of  five  trials  (days) 

(Fig.  5  and  Fig.  6).  Several  more  graphs  can  be  found  in  the 
appendix  (Figs.  7>  8,  and  9)*  In  Fig*  5>  a  comparison  of  the 
running  speeds  for  Ss  and  Cc  and  NCc  in  the  extinction  of  a 
position  habit  and.  brightness  discrimination  in  the  T-maze  is 
made.  By  the  last  day  there  was  little  difference  between  the 
running  speeds  of  the  different  groups  although  the  Ss  of  Cc  were 
extinguishing  the  acquired  position  habit  faster  than  the  other 
groups.  The  extinction  running  times  in  the  parallel  alley  maze 
(Fig.  6)  show  the  Ss  in  NCc  groups  for  both  tasks  extinguishing 
the  acquired  habit  more  quickly  than  those  in  the  CQ  groups.  There 
were  no  significant  differences  between  groups. 
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Table  7 


Means  and  Standard  Deviations  of  Error 
Scores  in  Extinction3- 


Method 

Problem 

Mean 

— 

SD 

Ai 

25.25 

5.36 

C 

A2 

22. 60 

4.51 

A1 

24.00 

4.6l 

R-C 

A2 

23.00 

4.81 

h 

26.67 

3.58 

NC 

A2 

23.60 

4.17 

ai 

24.80 

2.92 

R-NC 

A2 

22.75 

3-66 

40  NC  trials  at  5  trials/ day 
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BLOCH'S  Of  S’  TftALS  (PAYS) 


Fig.  5.  Combined,  correction  (Cc)  and.  combined  noncorrection  (NCc) 

methods’  in  extinction  of  a  position  habit  (Ap)  and  discrimi¬ 
nation  problem  (A2)  in  the  T-maze. 
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Fig.  6.  Combined  correction  (C^)  and  combined  noncorrection  (NC<?) 

methods  in  extinction  of  a  position  habit  (A-])  and  discrimi¬ 
nation  problem  (A2)  in  the  parallel  alley  maze. 
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VTEs.  Although  not  considered  a  formal  dependent  variable, 
in  terms  of  design,  the  behavior  of  the  rats  at  the  choice  point 
was  recorded  in  number  of  VTEs.  In  Table  8,  the  correlations  of 
VTEs  with  errors  and  correction  responses  in  acquisition  is  shown 
Two  significant  correlations  (p  .05),  the  correlations  between 
errors  and  VTEs  in  the  parallel  alley  maze  using  the  discrimination 
task,  were  found.  The  correlations  of  correct  responses  with  VTEs 
under  NC  (n  =  6  per  cell)  were  all  significant  (p  <  .05).  Under 
preference  factors,  the  correlation  between  VTEs  and  errors  (r  = 

.62  with  n  =  12)  was  the  only  significant  result.  Nonpreferred 
training  in  the  T-maze  with  the  position  task  showed  significant 
correlations  for  both  errors  and  correct  responses.  Additional 
correlations,  which  are  not  central  to  the  problems  of  this 
experiment,  may  be  found  in  the  appendices  (Tables  2  and.  3)- 
In  extinction,  the  correlations  between  errors  and  VTEs 
were  calculated  and  only  one  significant  r- value  (r  =  .93)  was 
obtained.  These  were  obtained  by  the  I -NC  group  learning  the 
position  task  in  the  T-maze. 


Correlations  between  Errors,  Correct  Responses  and  VTEs  for  Training  and  Preference  Faclor 
under  Position  Habit  and  Discrimination  Acquisition  in  the  Parallel  Alley  and  T-mazes 
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Discussion 

ihe  purpose  of  this  paper,  as  outlined  in  the  introduction,  was 
to  investigate  and  evaluate  the  relative  efficiency  of  the  four 
training  methods,  C,  R-C,  NC,  and  R-NC,  under  different  conditions. 
This  discussion  will  try  to  answer  the  questions  posed  in  terms 
of  the  experimental  results. 

A  significant  difference  between  the  training  methods  was  found 
in  all  dependent  variables  except  trials  with  NC^  generally  superior 
1°  methods.  Even  under  trials,  a  trend  toward,  fewer  learning 
trials  using  R-NC  can  be  seen  in  Table  2.  This  supports  the 
findings  of  Kalish  (1946)  as  does  the  error  data  for  A  .  In  his 
report,  Kalish  (1946)  stated  there  was  no  evidence  of  the  Ss  of  the 
correction  group  learning  the  problem.  In  contrast  to  this,  all 
the  Ss  in  the  present  study  reached  the  criterion  of  learning. 

Muenzinger  and  Powloski  (1951)  'found  C  significantly  better 
than  R-NC  in  a  light/ dark  discrimination  situation  using  a  T-maze. 

In  the  brightness  discrimination  task  in  this  study,  a  similar  trend 
was  found  in  the  T-maze  situation  in  terms  of  mean  error  scores. 
However,  these  results  were  not  significant.  In  contrast,  the  mean 
error  scores  in  the  parallel  alley  maze  for  the  brightness  discrimi¬ 
nation  task  show  significant  differences  between  C/R-C  and  R-NC/NC 
with  the  latter  two  methods  superior.  These  findings  support  the 
suggestion  made  by  Muenzinger  and  Powloski  (1951)  that  different 
learning  records  might  be  obtained  using  a  parallel  alley  maze.  They 
reasoned  that  C  might  become  less  effective  in  the  parallel  alley 
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maze  because  the  Ss  in  C  would  have  to  turn  around  the  choice  point 
and  face  the  left/right  choice  after  an  error,  whereas  in  the  T-maze, 
they  could  keep  moving  in  a  straight  line  past  the  choice  point  to 
the  rewarded  arm. 

The  results  reported  by  Towart  and  Boe  (1965),  in  terms  of  trials 
and  errors,  showed  no  significant  difference  between  C  and  R-NC  in 
learning  a  position  response  in  a  parallel  alley  maze.  These 
findings  were  supported,  by  this  study  (Table  2;  Appendices,  Fig.  la 
and  2a).  The  rerun  noncorrection  (R-NC)  procedure  appeared  to  be 
more  efficient  then  the  correction  methods  in  the  acquisition  of  a 
position  in  a  parallel  alley  maze  in  terms  of  all  dependent  variables 
except  runs  and  NC  more  efficient  except  in  terms  of  trials  and  runs. 

It  is  possible  that  the  selection  of  a  training  method  could  be  left 
to  the  discretion  of  the  E  when  using  the  parallel  alley  in  a  similar 
situation  as  the  methods  show  little  variation.  However,  it  is  likely 
that  one  of  the  noncorrection  procedures  would  provide  faster  learning. 

The  superiority  of  NC  over  C  in  terms  of  trials,  errors,  and  correct 
responses  in  a  T-maze  for  both  tasks--the  position  response  and  bright¬ 
ness  discrimination,  lend  support  to  the  work  of  Besch  et  al  (1963), 
Kalish  (19^-6),  Parducci  and  Polt  (1958),  and  Seward  ( 19^3) 

Within  the  terms  defined,  by  this  study,  the  noncorrection  methods 
produced  more  efficient  acquisition  of  both  tasks  in  the  two  maze 
designs.  Although  the  differences  between  training  methods  in  learning 
a  position  response  in  both  mazes  are  small,  large  differences  occur 
in  making  a  brightness  discrimination.  In  view  of  this,  the  training 
method  selected,  in  a  brightness  discrimination  situation  should,  be 
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carefully  considered  in  terms  of  the  dependent  variable  to  be  used 
and  the  apparatus  design. 

A  possible  explanation  for  the  efficiency  of  the  noncorrection 
procedures  involves  the  concept  of  behavior  sequences  (Kalish,  1946). 
The  S  learning  in  a  maze  by  one  of  the  correction  methods  is  faced 
with  two  alternatives:  (a)  to  take  the  long  path- -to  make  one  or 
more  wrong  responses  followed  by  the  correct  response,  or  (b)  to 
take  the  short  path- -to  make  the  correct  response.  In  this  behavior 
sequence,  a  S  could  receive  positive  reinforcement  of  the  wrong 
response  by  choosing  the  long  path.  In  the  noncorrection  methods, 
the  S  makes  one  of  two  different  responses --an  error  or  a  correct 
choice.  Thus,  there  are  two  distinct  behavior  sequences  in  non¬ 
correction  with  no  reinforcement  of  the  wrong  choice.  This  separa¬ 
tion  of  correct  and  incorrect  responses  may  be  an  important  factor 
in  learning  situations  (Hill,  Cotton,  &  Clayton,  1962) . 

The  apparatus  design  appeared  to  have  little  influence  upon 
learning  in  terms  of  trials  and  runs.  However,  design  did  appear 
to  have  some  effect  upon  acquisition  according  to  error  and  correct 
response  scores.  According  to  the  results  of  this  investigation, 
the  use  of  either  apparatus  design  in  a  similar  position  response  pro¬ 
blem  should  have  little  influence  on  the  training  procedure  selected. 
However,  in  the  case  of  the  discrimination  task,  the  parallel 
alley  maze  results  showed  the  C/R-C  and  NC/R-NC  methods  differed 
slightly  with  a  significant  difference  between  the  correction  and 
noncorrection  methods.  In  contrast,  the  T-maze  results  for  discri¬ 
mination  indicated  little  disparity  between  C  and.  R-NC  with  a 


significant  difference  between  R-C  and  NO.  In  the  case  of  discri¬ 
mination  problems,  the  selection  of  NC  should  prove  more  efficient 
in  either  apparatus  design. 

The  acquisition  of  a  position  habit  was  found  to  be  signifi¬ 
cantly  different  from  learning  a  brightness  discrimination.  In 
addition,  the  interaction  of  tasks  and.  training  methods  was  found 
in  terms  of  errors,  correct  responses  and  runs.  Little  dissimilarity 
between  training  methods  was  demonstrated  in  the  learning  of  a 
position  response  but  marked  slower  learning  was  found  when  C  or 
R-C  was  used  in  the  discrimination  task.  The  results  of  this 
experiment  lend  support  to  the  findings  of  McKelvey  (1956)  who 
found  NC  more  effective  in  learning  a  brightness  discrimination 
using  a  T-maze.  Moreover,  support  was  given  to  studies  using 
different  discrimination  tasks  which  found  NC  to  be  the  better 
procedure  (Besch  et  al. ,  1963;  Wischner,  19^7;  Wischner,  Fowler, 

&  Kushnick,  1963). 

Direct  comparison  of  the  number  of  reinforcements  received 
by  the  Ss  was  only  possible  between  C  and  R-NC.  In  this  experi¬ 
ment,  Ss  in  R-NC  showed  better  performance  except  in  the  T-maze 
discrimination  task  where  the  C  group  had  a  slight  (nonsignificant) 
advantage.  Since  correct  responses  were  defined  as  reinforcements, 
the  overall  picture  shows  the  NC  method,  in  which  the  Ss  received 
the  least  number  of  reinforcements,  to  be  superior  in  both  problems. 
These  findings  corroborate  those  of  Hill  et  al  (1962)  in  which  they 
showed,  that  even  though  NC  Ss  received,  half  as  many  reinforcements 
as  those  in  the  C  group,  acquisition  was  superior  by  NC.  They 
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suggested  that  these  findings  gave  further  support  to  the  improved 
efficiency  of  learning  a  discrimination  with  a  combination  of 
rewarded  and  nonrewarded  trials  than  with  all  trials  rewarded. 

North  (1950a;  1950b)  also  showed  that  NC  learned  the  given  task 
with  fewer  reinforcements  than  C  even  though  he  could  not  demon¬ 
strate  any  significant  differences  between  C  and  NC.  The  present 
experimental  findings  further  imply  that  the  nonreward  of  incorrect 
responses  is  an  important  consideration  in  similar  learning  situations. 

By  defining  a  correct  response  as  a  S ’ s  first  choice  which 
terminates  in  the  attainment  of  the  reward,  nonsignificant  results 
for  all  main  effects  and  interactions  were  found  in  the  present 
experiment.  (The  analysis  of  variance  results  can  be  examined 
in  the  appendix.)  With  correct  responses  recorded  as  first  choices, 
the  actual  number  of  reinforcements  a  S_  obtains  is  ignored.  In  this 
investigation,  the  number  of  reinforcements  and  their  influence 
in  learning  is  of  crucial  importance.  This  kind  of  analysis  points 
out  the  obvious  importance  of  definitional  considerations  not  only 
in  the  interpretation  of  results,  but  in  the  actual  collection  of 
data. 

When  Seward  (1943)  found  NC  superior  to  C  after  forced  trials 
to  each  side  of  the  maze,  he  speculated  on  the  influence  of  prefe¬ 
rence  bias.  He  suggested  that  the  lack  of  difference  between  C 
and  NC,  found  by  Hull  and  Spence  (1938),  might  have  been  due  to 
training  Ss  to  their  preferred  direction.  In  1956,  McKelvey  gave 
the  rats  in  pretraining,  two  free -choice  runs  followed  by  two 
forced  runs  to  counterbalance  position  habits.  In  interpretation 
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of  his  results ,  he  suggested  that  NC  tended  to  reduce  position 
habits  more  effectively  than  C.  In  the  present  study,  ten  prefe¬ 
rence  trials  were  given  to  all  Ss  with  half  the  Ss  trained  to 
their  preference  and  the  remainder  to  their  nonpreference.  The 
results  showed  a  significant  difference  between  training  to  pre¬ 
ference  and  training  to  nonpreference  which  agrees  with  past 
findings  (Brogden,  1951)*  However,  even  though  a  S  will  learn 
faster  when  trained  to  its  preference,  this  factor  did  not  influence 
the  training  methods  used  under  the  different  conditions.  There¬ 
fore  within  the  conditions  of  the  present  experiment,  it  would 
appear  that  preference  facilitates  learning  but  has  negligible 
influence  on  training  procedures  and  tasks. 

Extinction  scores  showed  little  difference  between  training 
methods  on  either  of  the  tasks.  This  implies  that  the  extinction 
process  was  little  influenced  by  training  methods  in  the  experi¬ 
mental  conditions.  The  running  speeds  showed  that  extinction  of 
the  original  learned  tasks  was  occurring  as  all  Ss  were  running 
slower  by  the  last  day.  The  running  speed  differences  by  the 
eighth  day  were  nonsignificant  between  the  training  groups.  The 
comparison  of  training  methods  in  acquisition  followed  by  extinction 
has  not  been  undertaken  before. 

The  correlational  analyses  of  VTEs  in  the  present  investigation 
can  be  interpreted  as  the  expression  of  a  specific  tendency  (Tolman  & 
Ritchie,  1943),  e.g.,  the  positive  correlation  between  correct  responses 
and  VTEs  across  all  conditions  when  using  the  NC  method  in  acquisition. 
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This  suggests  that  the  relationship  between  reinforcements  and 
VTEs  in  acquisition  may  have  influenced  learning  by  NC.  The 
correlations  found  between  errors  and  VTEs  in  extinction  were 
not  significant. 
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Summary  and  Conclusions 

An  experiment  was  performed  to  determine  the  relative  merits 
of  correction,  rerun  correction,  noncorrection,  and  rerun  noncor¬ 
rection  training  methods  in  a  parallel  alley  maze  and  T-maze 
using  the  acquisition  of  a  position  habit  and  the  determination 
of  a  brightness  discrimination  as  problems.  In  addition,  preference 
versus  non-preference  training  was  considered. 

The  results  support  the  following  conclusions: 

1.  There  was  a  significant  difference  between  training  methods 
on  all  defined,  dependent  variables  except  trials.  The  NCc  methods 
were  found  to  be  more  efficient  under  all  conditions. 

2.  Apparatus  design  interacted  significantly  with  tasks  and 
training  procedures  in  terms  of  error  and  correct  response  scores. 

The  correction  procedures  were  found  to  be  less  efficient  than 
the  noncorrection  procedures  in  the  parallel  alley  maze.  In  the 
T-maze  and  the  discrimination  problem,  little  difference  was 
found  between  C  and  R-NC,  but  a  significant  difference  was  found 
between  R-C  and  NC.  It  would  therefore  seem  that  one  of  the  non¬ 
correction  procedures  should  be  most  efficient  in  both  mazes-- 
particularly  in  the  discrimination  task. 

3.  There  was  a  significant  difference  between  learning  a 
position  response  and  determining  a  brightness  discrimination 
with  the  latter  task  requiring  a  longer  time  to  learn.  There 
was  little  difference  found  between  training  methods  in  learning 

a  position  response.  The  brightness  discrimination  task  differentiated 
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the  training  methods  more  clearly  and  NC  was  found,  to  be  the  most 
efficient  procedure  in  the  T-maze. 

Correct  responses  were  equivalent  to  reinforcements  in  all 
methods.  Despite  the  fact  that  it  received  the  least  number  of 
reinforcements,  NC  was  shown  to  be  the  superior  method,  on  both 
problems . 

5.  Training  to  preference  facilitated  learning  but  did  not 
influence  training  methods. 

6.  There  were  no  significant  findings  in  extinction.  All 
Ss  were  extinguishing  the  learned  habit  by  the  eighth  day  as  was 
shown  by  the  slower  running  speeds. 

7.  The  positive  correlation  between  VT'Es  and  correct 
responses  in  NC  under  all  conditions  suggest  that  a  relationship 
existed  between  the  number  of  reinforcements  and  VTEs  and  that 
this  relationship  may  have  facilitated  learning. 

The  use  of  rigorously  defined  training  methods  and.  dependent 
variables  has  made  available  a  basis  on  which  to  compare  future 
studies.  In  addition,  studies  which  reached  different  conclusions 
on  similar  problems  and.  utilized,  different  training  methods,  could 
be  reconsidered  in  the  light  of  these  findings.  The  use  of  one 
training  method  (NC)  in  the  extinction  of  the  acquired  habits, 
did  not  show  significant  differences  between  the  training  methods 
used  in  acquisition.  It  would  appear  that  the  noncorrection  methods 
are  to  be  preferred  in  most  maze  learning  situations. 
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Fig.  1.  The  A  x  T  interaction  in  terns  of  mean  error  scores. 
A-i  =  position  problem;  kz  =  discrimination  task. 

(a)  The  training  methods  are  considered  separately. 

(b)  The  training  methods  are  combined. 
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Fig.  3. 


The  D  x  T  interaction  in  terms  of  correct  responses. 
Fp  =  parallel  alley  maze;  I>2  =  T-maze. 
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Fie  4.  The  A  x  T  interaction  in  terms  of  mean  correct  responses 
A  =  position  problem;  A2  =  discrimination  task. 
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APPENDIX  10 
Table  1 

Analysis  of  Variance  of  First  Choices  as  Correct  Responses 

in  Acquisition 


df 

MS 

F 

Apparatus  (D) 

1 

130.67 

Problems  (A) 

1 

1617.04 

2.05 

Training  Methods  (T) 

3 

71.37 

Preference  (P) 

l 

66.67 

D  x  A 

1 

48.17 

D  x.  T 

3 

58.50 

D  x  P 

1 

7.04 

A  x  T 

3 

28.38 

A  x  P 

1 

10.67 

P  x  T 

3 

35.39 

D  x  A  x  T 

3 

23.56 

D  x  A  x  P 

1 

3.38 

D  x  P  x  T 

3 

36.93 

A  x  P  x  T 

3 

57.90 

D  x  A  x  P  x  T 

3 

7-53 

Treatments 

31 

916.81 

1.16 

Replications 

2 

379.08 

Error 

62 

789.25 
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APPENDIX  11 

Table  2 


Correlations  Between  Errors,  Correct  Responses  and  VTEs  for 
Combined  Correction  (Cc)  and  Noncorrection  (NCc)  Methods  Under 
Acquisition  of  a  Position  Habit  and  Discrimination  in  the 

Parallel  Alley  and.  T -Mazes 


Training 

Method 

Combined 

Problems^ 

Apparatus'^ 

Position 

Response 

Discrimi¬ 

nation 

Parallel 

Alley 

T-Maze 

E. 

C.R." 

E. 

C.R. 

E. 

C.R. 

E. 

C.R. 

Cc 

.21 

*.45 

.32 

.22 

*.69 

*.50 

*.56 

*.56 

NCc 

.12 

*.47 

*.47 

*.37 

*.70 

*.63 

*.45 

•  30 

*P<  -05 

Xn  -  24 


E  =  Error 


C.R.  =  Correct  Reponse 
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APPENDIX  12 

Table  3 


Correlations 
Problems  (Aj_ 


Between  Errors  and  VTEs  in  Extinction  and.  Learned 
and  Ag)  under  Training  Methods  and  Preference  Factors 
in  the  Parallel  Alley  and  T-Mazes. 


Problem 

Apparatus 

Training  Methods^ 

p 

Preference^ 

C 

R-C 

NC 

R-NC 

Preferred 

Non- 

Preferred 

Position 

Response 

% 

Parallel 

Alley 

.22 

.42 

-.26 

.27 

- .  06 

•  50 

T 

.11 

.70 

-.22 

*.93 

i — 1 

CO 

•  39 

Discri¬ 

mination 

A2 

Parallel 

Alley 

-.11 

.05 

-.17 

-.53 

.00 

-.23 

T 

-.46 

-.16 

- .  6l 

.44 

.08 

- - 

CO 

CM 

1 

An  =  6  per  cell 
^n  =  12  per  cell 


*P  <  .05 


